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ABSTRACT 

The major emphasis of the National Longitudinal Study 
of the High .School Class of 1972 (NLS) has been directed toward 
individuals in the 1972 high school graduating class. Conseguently , 
sampling parameters were established to ensure student 
representation, and the bulk of data collected were indivirtual-level 
data. School-level data exist on a separately documented database, 
and weights appropriate for analyzing these data have been computed. 
The purpose of this report is to define the post-stratification 
adjustment procedure used to develop appropriate school-level 
weights, given an initial school selection procedure designed to 
provide student representation. This report describes the NLS school 
sample design, computation of unadjusted student weights (used in 
determining weights for . individual high school seniors), and 
computation of adjusted weights for school-level analyses: 
(Author/BW) 
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INTRODUCTION 



A. Background 7 

The National Longitudinal Study of the High School Class of 1972 (HIS) is 
* large-scale survey which has been conducted by Educational Testing Service 
-tETS) and the Research Triangle Institute (RTI) for the National Center for 
Education Statistics (NCES). The NLS was designed and implemented to allow a 
better understanding of the development of young adults as they pass through 
the American .educational system and of the complex factors associated with 
individual educational and career outcomes. 

Following a rather extensive period of planning, which included the 
design and field test of survey instrumentation and procedures, a full-scale 
survey was initiated in spring 1972. This base-year survey (Hilton and Rhett, 
1973) conducted by ETS, collected student data onsite in the high schools, 
including high school record information, ..scores on a specially developed test 
battery, and questionnaire responses about high school experiences, background, 
attitudes and opinions, and plans for the future. Other information about the 
high school and its educational programs also was collected in the base year. 

Subsequently, four major follow-up surveys have been conducted by RTI 
fro m October through April or May of the years 1973-74, 1974-75, 1976-77, and 
1979-80 They involved the collection, by mail and personal interview, of 
student questionnaire data including information about activities and accom- 
plishments since the prior survey, attitudes and opinions, and plans for the 
future. Additionally, RTI ha, conducted several supplemental survey efforts 
to collect, retrospectively, key items of information that had not been provided 
during prior major surveys. Methodological details of the several follow-up 
surveys are available elsewhere (Bailey, 1976(b); Bailey, 1976(b); Levinsohn 
and McAdams, 1978; Riccobono, Burkheimer, and Place, 1981). 

B. * Pur pose of this Report • 

As indicated above, the major emphasis of NLS has been directed toward 
individuals in the 1972 high school graduating class.. Consequently, major 
-"sampling parameters were established to ensure student representation, and the 
"bulk of data collected has been individual-level data. Individual-level data 
exists on a well documented data base with appropriate adjusted individual 
sampling weights to allow a large number of individual-level analyses (c.f., 
Riccobono et al . , 1981). 
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Other data at the high school level were collected, however, as indicated 
above, and interest has existed in analyzing school level data and/or student 
data aggregated to the school level. The school-level data exist on a separately 
-documented data base (Henderson and Levinsohn, 1981), and weights appropriate 
-""Tor analyzing these data have been computed and included on the file. The 
Tjurpose of this report is to define the post-stratification adjustment procedure 
used to develop appropriate school-level weights, given an initial school 
selection procedure designed to provide student representation, which included 
oversampling 'of some schools and sampling of some others with probability 
proportional to student population size. The three remaining sections of this 
report describe (1) the NLS school sample design, (2) computation of unadjusted 
student weights (used in determining weights for individual high school seniors), 
and (3) computation of adjusted weights : (for. school-level analyses. 
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II. THE NLS SCHOOL SAMPLE ' 

A. Original Sample Design 

- The sample design may be described as . a deeply stratified ""probability 
Sample of schools. The population consisted of all public, private, and 
-Church-affiliated high schools in the 50 States and the District of -Columbia . 
The sampling frame was constructed from computerized school files maintained 
by the U.S. Of f ice 'of Education and by the National Catholic Education Associ- 
ation (Westat, Inc., 1972). 

The school sampling frame was stratified into 600 final strata based on 
the^ following variables: 

-Type of control' (public or nonpublic), 

-Geographic region (Northeast, North Central, South, and West), 
-Grade 12 enrollment (less than 300; 300 to 599; 600 or more), 
-Proximity to institutions of higher learning, 
-Percentage minority group enrollment, 
-Income level of the community, and 
-Degree of urbanization. 

In order to increase the numbers of disadvantaged students in the sample, 
schools located in low income areas and schools with high proportions of 
minority group enrollments were sampled at approximately twice the sampling 
rate used for the remaining schools. Schools in the smallest grade 12 enrollment 
strata (fewer than 300 seniors) were selected with probabilities proportional 
to their estimated numbers of senior students and without replacement. Schools 
in the remaining strata were selected with equal probabilities, without replace- 
ment. Within each -final stratum, four schools were selected initially and 
then two of the four were randomly selected and designated as the primary 
selections. The other two schools were retained as backup or substitute 
selections and were used in the sample only if one or both of the primary 
schools did not cooperate (e.g., refused, ineligible). 
2 In order to provide for representation of eligible schools that had not 
-been included in *.the lists used in making up the sampligg frame; a subsample 
"of 200 school districts was selected from among the districts represented in 
the primary sample of 1,200 schools. These districts were surveyed to obtain 
names and addresses of all schools with grade 12 enrollment thai had not been 
included on the original sampling frame, and an "augmentation sample" of 23 
schools was chosen from the list of 45 schools so compiled. 



B. The Final Sample 

Of the 1,200 schools that were originally selected, 21 were found to have 
no 12th grade students and 231 refused, leaving 948 primary schools to partici- 
pate in the Base Year survey. Of the replacements that were obtained from the 
iackup list, 122 schools participated, yielding a total of 1,070 base-year 
participating , schools. In 26 cases, school-level data were collected from 
"backup schools on the basis of a preliminary refusal of a primary sample 
school that later agreed to participate. These 26 cases, of which 18 also 
provided student-level data, thus represented "extra" schools (i.e., more than 
the required two schools per stratum). None of the augmentation sample schools 
were included in the Base Year survey (c.f., Hilton and Rhett, 1973). 

Because of the large school nonreponse in the Base Year, before beginning 
the- First Follow-up survey a resurvey effort was initiated to elicit responses 
from the 231 nonparticipating schools and to obtain replacements for all of 
the 21 schools that had no seniors (and therefore could provide no student-level 
data). As a result of this effort, 205 of the original refusal schools agreed 
to participate, together with 35 backup schools. The eight base-year "extra" 
schools from which no student data were obtained were dropped from the sample, 
and 16 of the 23 sampled augmentation schools participated, yielding a grand 
total of 1,318 participating schools in the final school sample. This, of 
course, represents more than the 1,223 schools that were anticipated in the 
original design and, therefore, results in strata with more than two schools. 
Due to the expense involved in collecting student data, however, a post hoc 
sample redefinition was implemented* rather than discarding data already collected 
(c.f., Moore 'and Shah, 1975; Moore, 1975; Shah and Marnell, 1976). The number 
of participating schools in various categories is summarized in Table 1. 



TABLE 1. NUMBER OF PARTICIPATING SCHOOLS BY CATEGORY 



Resurvey and 

— Base Year Augmentation Tot«l 

^Category Participants Additions Participating 



948 205 1,153 

Backup • lo-* 

96 35 131 

Augmentation _ 1_ -if R 

Total 1,070 265 1,318 



Primary 
:kup 
Extr 
Other 



a Backup schools from strata in which both primary schools ultimately partici- 
pated during the base year. 

* For 8 of the extra schools no student data were ever collected; consequently, 
these schools were dropped ^:om the sample. 
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III. UNADJUSTED SCHOOL WEIGHTS 

-v 

For each of the 1,302 schools of the original strata (the nonaugmentation 
schools), the sample inclusion probability, P hi , for selecting -school-i in 
stratum-h was calculated as 

P hi ^ D h A hi /A h » 

where n, = number of participating schools in the final NLS sample for 
stratum-h, 

A - size measure for school-i of stratum-h, 

hi 4 

and a = sum of size measures for all schools in stratum-h. 

h 

For nonaugmentation school strata defined by schools with less than 300 seniors, 
within which PPS sampling had been implemented, the size measure was the 
larger of the estimated number of seniors in the school and 18. For the 
remaining nonaugmentation school strata, within which simple random sampling 
had been implemented, the size measure was one. The use of the number of 
participating schools in each of the original final strata (nj accounts for 
the strata where the number of participating schools was not equal to two 
(c.f., Moore, 1975; Shah and Darnell, 1976). 

In the sample of 200 school districts canvassed during the Base Year to 
identify public schools not included on the sampling frame, a total of 45 such 
schools were identified and 23 were randomly selected as an augmentation 
sample (Hilton and Rhett, 1973). School election probabilities, P., we^ 
calculated for each of the 45 schools, as specified by the NLS sampling contrac- 
tor (Westat, 1972). Based on a joint NCES and RTI decirion, the 16 participating 
augmentation schools were grouped by pairs into eight additional strata (h = 

60 i, 602 608), and sample inclusion probabilities, adjusted for school 

nonparticipation, for these 16 schools were calculated as 

For all of the sample schools, unadjusted weights were computed as recipro- 
cals of their probabilities of selection. Explicitly 



W hi = lP hJ 
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Although the unadjusted weights were appropriate as a base for subsequent 
calculation of individual-level weights, a post-stratification technique was 
Elected for adjusting the nonaugmentation school weights, so that school-level 
Alio estimates based on the actual total number of schools in various categorxes 
could be routinely produced. This procedure reduces bias in school-level 
estimates resulting from the irregularities that had entered into the final 
sample Post-«tratif ication adjustment forces the weighted distribution of 
the participating schools to be the same across the post-strata as the popula- 
tion,distribution. No adjustments were made to the augmentation school weights 
beyond that implicit in their inclusion probability formula, since no further 
.information was available for these schools. 

Accordingly, weighting classes were established based on six of the 
primary stratification variables; namely, grade 12 enrollment, geographic 
region, percentage minority enrollment, income level of the community, type of 
control and proximity to institutions of higher learning. A separate class 
was established for the augmentation schools. Within a nonaugmentation school 
weighting class, the unadjusted weights were summed to obtain an estimate of 
the number of schools" in the class. An adjustment factor was then computed 
for each class by dividing the actual number of schools listed on the sampling 
frame in that class by the estimate far that class. The post-stratification 
adjusted weights were then obtained as the product of the unadjusted weights 
and the corresponding adjustment factor. . 

It should be- noted that the post-stratification adjustment for schools m 
the strata with 300 or more seniors (those drawn with equal probabilities 
within strata) is unity and, thus, h,s no effect on the unadjusted weights for 
these schools. This results from the facts that the schools were drawn with 
equal probabilities within sttata and from the way the school inclusion proba- 
' bilities were defined to account for the use of backup schools. For this 

reason, all- of the la.rge schools were grouped into one weighting' class, class 15. 
.'The remaining nonaugmentation "schools were grouped into the .first 14 weighting 
"classes shown in Table 2. The post-stratification adjustment factors, oge her 
with the number of sample and frame schools and the sums of unadjusted and 
past-stratification adjusted .weights, are shown in Table 3 for both nonaugmen- 
* ration schools and augmentation schools (weighting class 16). 
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TABLE 2 WEIGHTING CLASSES USED IN MAKING POST-STRATIFICATION 
ADJUSTMENT TO NONAUGMENTATION SCHOOL WEIGHTS 



Eeighti-rfg" 
\ Class 



Grade 12 
--Enrollment 
Size Class 1 



Geographic 
Region 2 



SES 
Type 3 



Type of 
Control 4 



Proximity.- No. of 
to Inst, of Sample 
Higher Learning 5 Schools 



1 

2 
3 
4 
5 
6 
7 
8 
9 

10 
11 
12 
13 
14 
15 



1 
1 
1 

2 
2 
2 
2 
2 
3 
3 
3 
3 
4 
4 



1 

2 
2 
1 
2 
2 
2 
2 
1 
1 
1 
2 
1 
2 



1 

2 

1 
1 
1 

2 



1 

2 
3 

1 

2 
3 



2,3 



24 
61 
31 
63 
20 
34 
44 
26 
30 
69 

119 
52 
26 
42 

661 



i/l = under 300; 2 = 300-599; 3 = 600 and over. 

2/1 = Northeast; 2 = North Central; 3 = South; 4 = West (U.S. Census definitions); 
blank = not used in definition. 

3/1 = low socioeconomic status; 2 = high socioeconomic status (Based on community 
Income level and percent minority enrollment (see Westat, Inc., 1972); blank - not 
used in definition. 



</l - public; 2 = nonpublic; blank = not used in definition. 



s/i = location in counties (New England only) or SMSAs C0 ° tainin %J,°° 1 "«"' ine 
central-city school districts; 2 = location in other counties or SMSAs containing 
a public 2 or 4-year college or university; 3 = location elsewhere; blank - not 



publ 

used in definition. 
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TABLE 3. WEIGHTING CLASS ADJUSTMENT FACTORS WITH NUMBERS OF FRAME 
AND SAMPLE SCHOOLS AND SUMS OF SCHOOL WEIGHTS 



Post- 

- r No. of No. of Sum of Stratification Sum of 

Weighting Schools Schools Unadjusted Adjustment Adjusted 

Class in Sample in Frame Weights Factor Weights 



1 


. 24 


284 


272.00 


1.04336 


28^.00 


2 


61 


1529 


1457.49 


1.04907 


1529.01 


3 


31 


896 


1049.30 


0.85390 


896.00 


4 


63 


1459 


1440.37 


1.01294 


1459.01 


5 


20 


476 


475.24 


1.00161 


476.00 


6 


34 


1057 


1117.05 


0.94625 


1057.00 


7 


44 


1783 


1723.42 


1.03457 


1783.00 


8 


26 


675 


775.51 


0.87040 


675.00 


9 


30 


364 


318.16 


1.14409 


364.00 


10 


69 


1099 


1177.04 


0.93370 


1099.01 


11 


119 


2422 


2358.23 


1.02705 


2422.01 


12 


52 


1547 


1511.80 


1.0232E 


1547.00 


13 


26 


476 


346.61 


1.37329 


476.00 


14 


42 


1670 


2076.76 


0.80414 


1670.00 


15 


661 


3149 


3149.00 


1.00000 


3149.00 


16 


16 




1474.84 


1.00000 


1474.84 



A final adjustment to the weights of the nonaugraentation schools was 
necessary to account for the fact that the frame included some schools that 
did not have a 12th grade enrollment. This was accomplished by multiplying 
each weight by 1,179/1,200, the proportion of the original primary sample with 
grade 12 enrollment. Table 4 presents the sums of the weights as finally 
adjusted by the eligibility factor (.9825), together with the minimum and 
maximum final weights by weighting class- 
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TABLE 4. WEIGHTING CLASS SUMS OF WEIGHTS AS ADJUSTED^ FOR 
ELIGIBILITY, WITH MINIMUM AND MAXIMUM WEIGHTS 

3 



^Weighting Sum of Minimum Maximum 

1* Class Final Weights Final Weight Final Weight 



1 


279.03 


2.08 


34.62 


2 


15D2.25 


* 8.64 


99.99 


3 


880.32 


1 2.95 


163.13 


4 


1433.48 


3.99 


102.32 


5 


467.67 


9.29 


118.21 


6 


1038.50 


7.12 


125.35 


7 


1751.80 


6.35 


205.39 


8 


663.19 


3.61 


151.84 


9 


357.63 


4.41 


35.30 


10 


1079.78 


3.17 


80.65 


11 


2379.62 


4.72 


98.53 


12 


1519.93 


6.26 


220.90 


13 


467.67 


5.49 


88.38 


14 


1640.78 


6.33 


151.91 


15 


3093.92 


.98 


10.81 


16 


14/4.84 


11.50 


563.50 
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